04:00
2026-06-17
arxiv.org
large-language-models
Not Truly Multilingual: Script Consistency as a Missing Dimension in VLM Evaluation
Researchers introduced PuMVR, a benchmark of 1,000 image-text pairs across Punjabi's three scripts, and found that state-of-the-art vision-language models show a systematic Script Gap, with accuracy dโฆ